This is an exploratary analysis on foreign aid provided by USA.This aid given by the United States to other countries to support global peace, security, and development efforts, and provide humanitarian relief during times of crisis. It is a strategic, economic, and moral imperative for the United States and vital to U.S. national security.
You can read more via this link at
wikipedia
Which country gets the highest US aid

- More than 50% of the aid goes to 3 countries i.e Afghanistan, Israel, Iraq.
- Afghanistan and Iraq are likely due to the war and conflicts for a long time and nation rebuilding.
- Not sure on the aid to Israel.
- Pakistan is in the top 5 countries that takes aid for US.

- Middle east and Africa are the top regions where the aid is sent. Iraq, Afghanistan, Iran, Israel are all in middle east.
- Interestingly, none of the African countries are in the earlier top 5 chart.

- Most of the middle east and Africa will fall in the low income category.
- Interesting to see even high income countries are getting aid from USA, will analyze what sort of groups aid such countries.

- Department of Defense (DoD), the constant character in all the Hollywood action/war movies.













- India shows a spike during late 1950’s till 70’s. This is likely due to post India independence era.
- Pakistan is showing the spike at the same time but continues to show an ever increasing trend.
- Interesting to see that India never crossed the billion mark while Pakistan is consistently >1B since year 2000
---
title: "US Foreign Aid Analysis"
output: html_notebook
---

This is an exploratary analysis on foreign aid provided by USA.This aid given by the United States to other countries to support global peace, security, and development efforts, and provide humanitarian relief during times of crisis. It is a strategic, economic, and moral imperative for the United States and vital to U.S. national security.

You can read more via this link at [wikipedia](https://en.wikipedia.org/wiki/United_States_foreign_aid "wikipedia")
<hr>

```{r global_options, include=FALSE}
knitr::opts_chunk$set(fig.width=6.1, fig.height=3,fig.align ="center",echo=FALSE, warning=FALSE, message=FALSE, rows.print=30,cols.print=10)

# fig.asp=0.618

```


```{r load_libs}
library(tidyverse)
library(data.table)
library(gridExtra)
source("/Users/vinaychuri/VINAY/VINAY_R/vinay_functions/vinay_func.R")

# theme_set(theme_gray(12) +  theme(plot.title = element_text(face = "bold"),text = element_text(family = "Arial")))

theme_set(ggthemes::theme_fivethirtyeight(12) +  theme(plot.title = element_text(face = "bold"),text = element_text(family = "Arial")))

```


```{r, include=FALSE}

df<- fread("/Users/vinaychuri/VINAY/dataset/us_foreign_aid/us_foreign_aid_complete.csv")

df<- janitor::clean_names(df)

```

```{r , eval=FALSE,include=FALSE}
str(df)
```


```{r all_functions}
get_counts<- function(in_df,in_cols,topn=10){
  return(in_df[, .(freq=.N), by = c(in_cols)][order(-freq)][1:topn])
  
}

get_top_30_per<-function(in_dt,lov=NA,in_cols=NA,topn=10){
  top_20_df<-data.frame()
  for(x in lov) {
    # use get() on variable
    tmp_df<-in_dt[get(in_cols[1])==x,.(freq=.N),by=c(in_cols)][order(-freq)][1:topn]
    top_20_df<-bind_rows(top_20_df,tmp_df)
    rm(tmp_df)
  }
  return(top_20_df)
  
}


get_aid_summ<-function(data=in_dt,in_col,topn=10,title2=NA){
  tmp_df=df[,.(tot_aid=sum(as.numeric(current_amount))/1e9),by=c(in_col)][order(-tot_aid)]
  tmp_df=tmp_df[get(in_col)!='World',,]
  tmp_df=tmp_df[,"total_percentage":=round(tot_aid/sum(tot_aid)*100,2)]
  # tmp_df=head(tmp_df,topn)
  # tmp_df=tmp_df[,"topn_percentage":=round(tot_aid/sum(tot_aid)*100,2)]
  tmp_df=tmp_df[,"cummulative_percentage":=cumsum(total_percentage)] ## cumulative percentage
  tot_rows<- nrow(tmp_df)
  tmp_df<-head(tmp_df,topn)
  topn<-min(topn,tot_rows)

  # in_col=enquo(in_col) "https://stackoverflow.com/questions/22309285/how-to-use-a-variable-to-specify-column-name-in-ggplot"
  in_col<-ensym(in_col)
  p1<-tmp_df%>%
    ggplot(aes(reorder(!!in_col,tot_aid),tot_aid))+
    geom_col(fill="#4779C7")+
    coord_flip()+
    labs(title=title2,subtitle=paste0("Top ",topn," of ",tot_rows),x="",y="Total aid in billions")+
    scale_y_continuous(labels = scales::comma)
  
  tab1<-as.data.frame(tmp_df%>%slice(0:5))
  
  return(list("p1"=p1,"tab1"=tab1))
  

  
}

```



Which country gets the highest US aid
```{r,fig.width=6.1,fig.height=3}
my_list<-get_aid_summ(data=df,in_col="country_name",topn=15,title2 = "Country wise aid summary")
grid.arrange(my_list$p1,tableGrob(my_list$tab1,theme = ttheme_default(base_size = 10)),heights=c(3,1))


```

* More than 50% of the aid goes to 3 countries i.e Afghanistan, Israel, Iraq.
* Afghanistan and Iraq are likely due to the war and conflicts for a long time and nation rebuilding.
* Not sure on the aid to Israel.
* Pakistan is in the top 5 countries that takes aid for US.

<br><hr>

```{r,fig.width=6.1,fig.height=3}

my_list<-get_aid_summ(data=df,in_col=c("region_name"),topn=15,title2 = "Region wise aid summary")
grid.arrange(my_list$p1,tableGrob(my_list$tab1,theme = ttheme_default(base_size = 10)),heights=c(2,1))



```

* Middle east and Africa are the top regions where the aid is sent. Iraq, Afghanistan, Iran, Israel are all in middle east.
* Interestingly, none of the African countries are in the earlier top 5 chart.

<br><hr>


```{r,fig.width=6.1,fig.height=3}
my_list<-get_aid_summ(data=df,in_col=c("income_group_name"),topn=15,title2 = "income_group_name wise aid summary")
grid.arrange(my_list$p1,tableGrob(my_list$tab1,theme = ttheme_default(base_size = 10)),heights=c(2,1))
 


```

* Most of the middle east and Africa will fall in the low income category.
* Interesting to see even high income countries are getting aid from USA, will analyze what sort of groups aid such countries.


<br><hr>

```{r,fig.width=6.1,fig.height=3}
my_list<-get_aid_summ(data=df,in_col=c("implementing_agency_name"),topn=15,title2 = "implementing_agency_name wise aid summary")
grid.arrange(my_list$p1,tableGrob(my_list$tab1,theme = ttheme_default(base_size = 10)),heights=c(2,1))



```

* Department of Defense (DoD), the constant character in all the Hollywood action/war movies.

<br><hr>

```{r,fig.width=6.1,fig.height=3}

my_list<-get_aid_summ(data=df,in_col=c("subagency_name"),topn=15,title2 = "subagency_name wise aid summary")
grid.arrange(my_list$p1,tableGrob(my_list$tab1,theme = ttheme_default(base_size = 10)),heights=c(2,1))


```

<br><hr>

```{r,fig.width=6.1,fig.height=3}
my_list<-get_aid_summ(data=df,in_col=c("channel_category_name"),topn=15,title2 = "channel_category_name wise aid summary")
grid.arrange(my_list$p1,tableGrob(my_list$tab1,theme = ttheme_default(base_size = 10)),heights=c(2,1))


```

<br><hr>


```{r,fig.width=6.1,fig.height=3}
my_list<-get_aid_summ(data=df,in_col=c("channel_subcategory_name"),topn=15,title2 = "channel_subcategory_name wise aid summary")
grid.arrange(my_list$p1,tableGrob(my_list$tab1,theme = ttheme_default(base_size = 10)),heights=c(2,1))



```


<br><hr>

```{r,fig.width=6.1,fig.height=3}
my_list<-get_aid_summ(data=df,in_col=c("dac_category_name"),topn=15,title2 = "dac_category_name wise aid summary")
grid.arrange(my_list$p1,tableGrob(my_list$tab1,theme = ttheme_default(base_size = 10)),heights=c(2,1))


 
```


<br><hr>

```{r,fig.width=6.1,fig.height=3}
my_list<-get_aid_summ(data=df,in_col=c("dac_sector_name"),topn=15,title2 = "dac_sector_name wise aid summary")
grid.arrange(my_list$p1,tableGrob(my_list$tab1,theme = ttheme_default(base_size = 10)),heights=c(2,1))


 
```


<br><hr>

```{r,fig.width=6.1,fig.height=3}
my_list<-get_aid_summ(data=df,in_col=c("dac_purpose_name"),topn=15,title2 = "dac_purpose_name wise aid summary")
grid.arrange(my_list$p1,tableGrob(my_list$tab1,theme = ttheme_default(base_size = 10)),heights=c(2,1))



```


<br><hr>

```{r,fig.width=6.1,fig.height=3}
my_list<-get_aid_summ(data=df,in_col=c("funding_account_name"),topn=15,title2 = "funding_account_name wise aid summary")
grid.arrange(my_list$p1,tableGrob(my_list$tab1,theme = ttheme_default(base_size = 10)),heights=c(2,1))


```


<br><hr>

```{r,fig.width=6.1,fig.height=3}
my_list<-get_aid_summ(data=df,in_col=c("funding_agency_name"),topn=15,title2 = "funding_agency_name wise aid summary")
grid.arrange(my_list$p1,tableGrob(my_list$tab1,theme = ttheme_default(base_size = 10)),heights=c(2,1))




```


<br><hr>

```{r,fig.width=6.1,fig.height=3}
my_list<-get_aid_summ(data=df,in_col=c("aid_type_group_name"),topn=15,title2 = "aid_type_group_name wise aid summary")
grid.arrange(my_list$p1,tableGrob(my_list$tab1,theme = ttheme_default(base_size = 10)),heights=c(2,1))



```


<br><hr>

```{r,fig.width=6.1,fig.height=3}
my_list<-get_aid_summ(data=df,in_col=c("activity_name"),topn=15,title2 = "activity_name wise aid summary")
grid.arrange(my_list$p1,tableGrob(my_list$tab1,theme = ttheme_default(base_size = 10)),heights=c(2,1))



```

<br><hr>


```{r,fig.width=6.1,fig.height=3}
my_list<-get_aid_summ(data=df,in_col=c("usg_sector_name"),topn=15,title2 = "usg_sector_name wise aid summary")
grid.arrange(my_list$p1,tableGrob(my_list$tab1,theme = ttheme_default(base_size = 10)),heights=c(2,1))



```


<br><hr>

```{r}
my_list<-get_aid_summ(data=df,in_col=c("assistance_category_name"),topn=15,title2 = "assistance_category_name wise aid summary")
grid.arrange(my_list$p1,tableGrob(my_list$tab1,theme = ttheme_default(base_size = 10)),heights=c(2,1))


```


<br><hr>

```{r,fig.width=6.1,fig.height=3}
df%>%
  filter(country_name %in% c("India","Pakistan"))%>%
  group_by(country_name,fiscal_year)%>%
  summarise(tot_aid=sum(as.numeric(current_amount))/1e9)%>%    # 1e9- billion ,1e6=million
  mutate(fiscal_year=as.numeric(ifelse(fiscal_year=='1976tq',"1976",fiscal_year)))%>%
  ggplot(aes(fiscal_year,tot_aid,color=country_name))+
  geom_line()+
  labs(title="Trends of US foreign aid to India and Pakistan",y="in billions",x="")+
  theme(axis.text.x = element_text(angle = 90, hjust = 1))

# paste(round(x / 1e6, 1), "M")
```

* India shows a spike during late 1950's till 70's. This is likely due to post India independence era.
* Pakistan is showing the spike at the same time but continues to show an ever increasing trend.
* Interesting to see that India never crossed the billion mark while Pakistan is consistently >1B since year 2000


```{r , include=FALSE,eval=FALSE,fig.width=6.1}
# Sat Aug 17 17:11:16 2019 -----------------------COMMENTSHERE-----------------------

df[,.(tot_aid=sum(as.numeric(current_amount)/1000000)),by=.(implementing_agency_name)][order(-tot_aid)][1:5]

dt1<-df[,.(tot_aid=sum(as.numeric(current_amount))),by=.(implementing_agency_name)][order(-tot_aid)][1:5]
dt2<-df%>%
  group_by(implementing_agency_name)%>%
  summarise(tot_aid=sum(current_amount))%>%
  filter(implementing_agency_name!='World')%>%
  arrange(-tot_aid)%>%
  slice(1:5)%>%
  mutate(tot_aid=tot_aid/1000000)

res=microbenchmark::microbenchmark(dt1,dt2,times=100)
autoplot(res)
plot(res)


table(df$fiscal_year)

df%>%
  filter(country_name %in% c("India","Pakistan"))%>%
  group_by(country_name,fiscal_year)%>%
  summarise(tot_aid=sum(as.numeric(current_amount))/1000000)%>%
  reshape2::dcast(fiscal_year~country_name,value.var="tot_aid",fill=0)%>%
  ggplot(aes(fiscal_year))+
  geom_line(aes(y = "tot_aid"),group=1, colour = "blue")+
  geom_line(aes(y = "tot_aid"),group=1, colour = "red")+
  theme(axis.text.x = element_text(angle = 90, hjust = 1))
  # theme(legend.position = "botton")



df%>%
  filter(country_name %in% c("India","Pakistan"))%>%
  group_by(country_name,fiscal_year)%>%
  summarise(tot_aid=sum(as.numeric(current_amount))/1000000)%>%
  reshape2::dcast(fiscal_year~country_name,value.var="tot_aid",fill=0)%>%
  ggplot(aes(fiscal_year,India,color="blue"))+
  geom_line(group=1)+
  geom_line(aes(fiscal_year,Pakistan,color="red"),group=1)+
  theme(axis.text.x = element_text(angle = 90, hjust = 1))



df%>%
  filter(country_name %in% c("India","Pakistan"))%>%
  group_by(country_name,fiscal_year)%>%
  summarise(tot_aid=sum(as.numeric(current_amount))/1000000)%>%
  mutate(fiscal_year=as.numeric(ifelse(fiscal_year=='1976tq',"1976",fiscal_year)))%>%
  ggplot(aes(fiscal_year,tot_aid,color=country_name))+
  geom_line()+
  theme(axis.text.x = element_text(angle = 90, hjust = 1))



library(tidyverse)
library(lubridate)
library(ggthemes)
library(ggrepel)
library(gganimate)



df%>%
  filter(country_code %in% c("IN","PAK"))%>%
  group_by(country_code,fiscal_year)%>%
  summarise(tot_aid=sum(as.numeric(current_amount))/1000000)%>%
  mutate(fiscal_year=as.numeric(ifelse(fiscal_year=='1976tq',"1976",fiscal_year)))%>%
  ungroup()%>%
  group_by(fiscal_year) %>%
  mutate(ordering = rank(tot_aid)) %>%
  ungroup()->plotdata

  
  


# plotdata <- df.GDPpercap %>%
#   group_by(year) %>%
#   mutate(ordering = rank(GDPpercap)) %>%
#   ungroup() 
# 
p<-ggplot(plotdata,
       aes(ordering, group = country_code,color=country_code,fill=country_code)) +
  geom_tile(aes(y = tot_aid/2,height = tot_aid, width = 0.9), alpha = 0.9) +
  # text on top of bars
  geom_text(aes(y = tot_aid, label = country_code), hjust = -0.4) +
  # text in x-axis (requires clip = "off" in coord_cartesian)
  geom_text(aes(y = 0, label = country_code), hjust = 1.4) +
  coord_flip(clip = "off", expand = FALSE) +
  scale_color_viridis_d(name="")+
  scale_fill_viridis_d(name="")+
  scale_y_continuous(labels=scales::dollar)+
  theme_tufte(14,"Avenir")+
  guides(color=F,fill=F)+
  labs(title='{frame_time}', x = "",y="US Foreign aid ($2012)") +
  theme(plot.title = element_text(hjust = 1, size = 22),
        axis.ticks.y = element_blank(),
        axis.text.y  = element_blank()) + 
  transition_time(year)+
  ease_aes('cubic-in-out')

animate(p, nframes = 100, fps = 5, end_pause = 20) #again, use anim_save(filename) to save




```


